An Integrated Environment for High-dimensional Geographic Data Mining

نویسندگان

  • Diansheng Guo
  • Mark Gahegan
  • Alan M. MacEachren
چکیده

Introduction Geographic data are often very large in volume and “characterized by a high number of attributes or dimensions” [1]. There are urgent needs to develop effective and yet efficient approaches for analyzing such voluminous and high-dimensional data to address complex geographic problems [1, 2, 3, 4], e.g., detecting unknown multivariate patterns or relationships between socioeconomic, demographic, environmental factors and the incidence of various cancers. This paper introduces an integrated geographic data mining environment, which couples a suite of visualization and computational methods to explore multivariate patterns in large and high-dimensional geographic datasets. The integrated geographic data mining environment involves four major groups of components: (1) interactive feature selection components to identify interesting subsets of variables for further analysis[5]; (2) self-organizing map (SOM) [6] components to cluster data objects with only the variables selected above; (3) a high-dimensional visualization component—Parallel Coordinate Plot (PCP) [7]—to explore and present multivariate patterns or relationships; and (4) a geographic map component to visualize the spatial distribution of patterns discovered above. With interactive manipulation of these integrated components, the user can iteratively locate, interpret, and refine patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Geostatistical Approaches for Geovisual Data Exploration, Analysis and 3D-Visualisation in Civil Security

This contribution presents selected approaches, methods and tools to facilitate geovisual analytical data exploration for civil security purposes. To analyse large emergency service data of a major German city’s fire department, different data mining techniques are applied. This allows identifying statistical significant clusters in space and time. To facilitate convenient methods for exploring...

متن کامل

Extend Table Lens for High-Dimensional Data Visualization and Classification Mining

Data mining and information visualization apply different techniques to solve the same problem of extracting useful information hidden in large amount of data. In this project, we focus on classification problem and developed an integrated information visualization environment, which plugs in classification rule-mining methods and extends the Table Lens techniques to handle high-dimensional dat...

متن کامل

GIS modelling for Au-Pb-Zn potential mapping in Torud-Chah Shirin area-Iran

One of the major strengths of a Geographic Information System (GIS) in geosciences is the ability to integrate and combine multiple layers into mineral potential maps showing areas which are favorable for mineral exploration. These capabilities make GIS an extremely useful tool for mineral exploration. Several spatial modeling techniques can be employed to produce potential maps. However, these...

متن کامل

An Integrated Baseline Geodatabase for Facilitating the Environmental Impact Assessment Process: Case Study of Sabalan Geothermal Project, Iran

Baseline data represent one of the important stages of Environmental Impact Assessment (EIA) procedure that describes the existing environment of the study area and surrounding areas in enough detail to allow the environmental impacts of the proposed area to be accurately and adequately assessed, and future changes and effects can be measured. Baseline data may be inaccurate, difficult to obtai...

متن کامل

Calculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms

The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004